High Dimensional Dataset Compression Using Principal Components
نویسندگان
چکیده
Until recently, computational power was insufficient to diagonalize atmospheric datasets of order 10 10 elements. Eigenanalysis of tens of thousands of variables now can achieve massive data compression for spatial fields with strong correlation properties. Application of eigenanalysis to 26,394 variable dimensions, for three severe weather datasets (tornado, hail and wind) retains 9 11 principal components explaining 42% 52% of the variability. Rotated principal components (RPCs) detect localized coherent data variance structures for each outbreak type and are related to standardized anomalies of the meteorological fields. Our analyses of the RPC loadings and scores show that these graphical displays can efficiently reduce and interpret large datasets. Data is analyzed 24 hours prior to severe weather as a forecasting aid. RPC loadings of sea-level pressure fields show different morphology loadings for each outbreak type. Analysis of low level moisture and temperature RPCs suggests moisture fields for hail and wind which are more related than for tornado outbreaks. Consequently, these patterns can identify precursors of severe weather and discriminate between tornadic and non-tornadic outbreaks.
منابع مشابه
Compression of Breast Cancer Images By Principal Component Analysis
The principle of dimensionality reduction with PCA is the representation of the dataset ‘X’in terms of eigenvectors ei ∈ RN of its covariance matrix. The eigenvectors oriented in the direction with the maximum variance of X in RN carry the most relevant information of X. These eigenvectors are called principal components [8]. Ass...
متن کاملCompression of Breast Cancer Images By Principal Component Analysis
The principle of dimensionality reduction with PCA is the representation of the dataset ‘X’in terms of eigenvectors ei ∈ RN of its covariance matrix. The eigenvectors oriented in the direction with the maximum variance of X in RN carry the most relevant information of X. These eigenvectors are called principal components [8]. Ass...
متن کاملSubspace-Clustering-Based Multispectral Image Compression
This paper describes a subspace clustering strategy for the spectral compression of multispectral images. Unlike standard PCA, this approach finds clusters in different subspaces of different dimension. Consequently, instead of representing all spectra in a single low-dimensional subspace of a fixed dimension, spectral data are assigned to multiple subspaces having a range of dimensions from on...
متن کاملReconstruction of Walking People Images by Principal Component Analysis
The Principal Component Analysis (PCA) is a useful statistic technique that has found applications in fields such as recognition, classification and image data compression. It is also a common technique in extracting features from data in a high dimensional space. By linearly transforming the images into eigenspace, we project the images into a new N Dimensional space, which exhibits the proper...
متن کاملA New Method for Principal Component Analysis of High-Dimensional Data Using Compressive Sensing
Principal Component Analysis of high dimensional data often runs into time and memory limitations. This is especially the case if the dimension and the number of data set elements is of about the same size. We propose a new method to calculate Principal Components based on Compressive Sensing. Compressive Sensing can be interpreted as a new method for data compression with a number of positive ...
متن کامل